Green ICR: Semi-Automated Census Record Indexing with Emphasis on Human Computer Interaction

نویسندگان

  • Robert Clawson
  • William Barrett
چکیده

Human-based computation is an approach that utilizes the abilities and strengths of both humans and computers to achieve a symbiotic interaction that is stronger than either agent in isolation. We propose a system that amplifies the capacity of a human indexer by adding an intelligent handwriting recognition engine to the indexing process. This recognition engine will learn patterns in handwriting as the indexer works, and then will amplify the indexer by automatically labeling handwriting with similar patterns. The recognition engine may also prompt the user to label examples that will best help it to learn. Preliminary results show that applying handwriting recognition technology could significantly reduce the number of fields an indexer is required to hand label. As a byproduct, we also believe the proposed system will be more interactive and more enjoyable for the indexer.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Intelligent indexing: a semi-automated, trainable system for field labeling

We present Intelligent Indexing: a general, scalable, collaborative approach to indexing and transcription of non-machinereadable documents that exploits visual consensus and group labeling while harnessing human recognition and domain expertise. In our system, indexers work directly on the page, and with minimal context switching can navigate the page, enter labels, and interact with the recog...

متن کامل

Using a Hidden-Markov Model in Semi- Automatic Indexing of Historical Handwritten Records

Indexing of historical records is a process that uses human effort to read text images and convert them into a machine readable format that facilitates search. The Church of Jesus Christ of Latter-day Saints has been using volunteers to index millions of microfilm images of genealogy records collected throughout the world. This indexing process is time-consuming. We adapt a technique for holist...

متن کامل

A Semi-Automated Algorithm for Segmentation of the Left Atrial Appendage Landing Zone: Application in Left Atrial Appendage Occlusion Procedures

Background: Mechanical occlusion of the Left atrial appendage (LAA) using a purpose-built device has emerged as an effective prophylactic treatment in patients with atrial fibrillation at risk of stroke and a contraindication for anticoagulation. A crucial step in procedural planning is the choice of the device size. This is currently based on the manual analysis of the “Device Landing Zone” fr...

متن کامل

A Survey of Current Methods in Medical Image Segmentation

Image segmentation plays a crucial role in many medical imaging applications by automating or facilitating the delineation of anatomical structures and other regions of interest. We present herein a critical appraisal of the current status of semi-automated and automated methods for the segmentation of anatomical medical images. Current segmentation approaches are reviewed with an emphasis plac...

متن کامل

Evaluation of a Binary Semi-supervised Classification Technique for Probabilistic Record Linkage.

BACKGROUND The process of merging data of different data sources is referred to as record linkage. A medical environment with increased preconditions on privacy protection demands the transformation of clear-text attributes like first name or date of birth into one-way encrypted pseudonyms. When performing an automated or privacy preserving record linkage there might be the need of a binary cla...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013